Robust Joint Precoder and Equalizer Design in 
MIMO Communication Systems 

Saeed Kaviani and Witold A. Krzymieri 

University of Alberta and TRLabs, Edmonton, Alberta, Canada T6G 2V4 

E-mail: {saeedk,wak}@ece.ualberta.ca 



CO . 

O' 

<n : 

Of 

<: 



CO 



> 

CN 

^t 

^t 

o 
m 



x 



Abstract — We address joint design of robust precoder and 
equalizer in a MIMO communication system using the mini- 
mization of weighted sum of mean square errors. In addition 
to imperfect knowledge of channel state information, we also 
account for inaccurate awareness of interference plus noise 
covariance matrix and power shaping matrix. We follow the 
worst-case model for imperfect knowledge of these matrices. 
First, we derive the worst-case values of these matrices. Then, we 
transform the joint precoder and equalizer optimization problem 
into a convex scalar optimization problem. Further, the solution 
to this problem will be simplified to a depressed quartic equation, 
the closed-form expressions for roots of which are known. Finally, 
we propose an iterative algorithm to obtain the worst-case robust 
transceivers. 

I. Introduction 

Deployment of multiple antennas promises significant ca- 
pacity gains in wireless systems |Qj-[4|, which motivates 
construction of pragmatic signalling strategies exploiting these 
gains. Nevertheless, these improvements are severely degraded 
by interference [5 1. Therefore, combination of precoding at the 
transmitter and equalization at the receiver is often employed 
to reduce the interference and maximize the performance of 
system. In this setup, linear strategies have attracted more 
attention due to their simplicity and robustness. Although 
various multiple-input multiple-output (MIMO) linear pre- 
coding and equalization methods have been proposed ||6l- 
[|9l , they assume that the channel state information (CSI) 
and interference plus noise covariance matrix are perfectly 
known at the transmitter and receiver. Unavailability of exact 
information of channel matrices may diminish the performance 
of the transceivers significantly. This motivates studying of the 
robust linear transmitter and/or receiver design problem [fTOl - 

ED. 

There are normally two philosophies to consider imperfect 
CSI and to design robust transmission strategies: Worst-case 
deterministic scenario and stochastic scenario. In stochastic 
scenario, the CSI errors are modeled probabilistically and the 
average performance is optimized ll20l - ||23l . In this paper, we 
consider the worst-case scenario since it can characterize the 
instantaneous imperfect knowledge of system matrices. In this 
approach, the actual system matrices are assumed to lie within 
a so-called uncertainty region around the estimated values 
known by the transmitter. A worst-case robust design is a 
design which achieves a particular performance level for any 
channel realization staying in the corresponding uncertainty 
region ED-ED. El- 



Worst-case robust transceiver design has been recently con- 
sidered in ||T2l . lfT31 . In 11121 . joint optimization of transceivers 
is addressed using semi-definite program (SDP) reformulation. 
However, even for the case of perfect channel knowledge it 
only gives a suboptimal solution [25]. Moreover, SDP-based 
approaches (see for example iflCfl . 0Tj) do not give a closed- 
form solution and the resultant algorithms require solving a 
SDP at each iteration. Closed-form solution for the worst-case 
robust MMSE precoder assuming pre-fixed equalizer is given 
in [ 13 1 and it is extended to design of worst-case robust MMSE 
transceivers in |fT31 . Nevertheless, the proposed algorithm 
is based on alternative optimization between precoder and 
equalizer (rather than joint optimization). It also involves 
solving a quintic equation, for which a closed-form solution 
is unknown and it is only solved numerically. 

Conventionally, the robust transmission strategies in the 
MIMO communication systems consider imperfect knowledge 
of channel gains between the transmit and receive antennas. 
Here, we consider a wider range of system parameters all 
known inaccurately to the system. In addition to CSI, we 
consider imperfect knowledge of interference plus noise co- 
variance matrix, and power shaping matrix. Moreover, our 
objective is weighted sum of mean square error minimization 
which is a more general performance metric [I261 . We avoid 
SDP reformulation to solve the problem and consequently the 
proposed algorithm has lower complexity. First, we obtain 
the worst-case system matrices within the uncertainty region. 
Then, we jointly optimize precoder and equalizer where the 
resultant matrix-valued optimization problem is reduced to a 
scalar convex problem. Further, the solution to this problem 
can be simplified to a depressed quartic equation. Interest- 
ingly, the solution of a quartic equation can be expressed in 
terms of radicals. Hence, a closed-form expression for the 
precoder and equalizer matrices can be obtained. Finally, we 
propose an iterative algorithm to find the optimal transceivers. 
Note that the analysis and design approach for point-to-point 
MIMO systems presented in this paper can be extended to 
multiuser MIMO systems |27l . 

Notation: We denote positive semi-definite matrices as A ^ 
0. Capital bold letters represent matrices and small bold letters 
represent vectors. We denote conjugate transpose (Hermitian) 
operator with (•) . A~2 represents the inverse square root 
of positive definite matrix A. tr{-} is matrix trace operator 
and || • || demonstrates Frobenius norm of a matrix. A max (-) is 
reserved to denote the maximum eigenvalue of a matrix. 



II. System Model and Preliminaries 

We consider a MIMO communication system, where the 
transmitter is equipped with n t antennas and the receiver 
employs n r antennas (n r < n t ). The transmitter broadcasts 
a data vector denoted by u G C" r using the linear precoding 
matrix F G C 1 *""', The channel between the transmitter and 
the receiver is characterized by the matrix H G C" rXn '. The 
receiver observes the signal 



y = HFu 



(1) 



where n represents the correlated interference plus noise vec- 
tor. We denote the interference plus noise covariance matrix as 
fi = E [nn H ] G C 7lrXrir . The linear processing at the receiver 
can be characterized by the equalizer matrix G G C n '' x ™ r . 
Hence, the estimated symbol vector at the receiver can be 
described as 



u = Gy. 



Let us define the estimation error covariance matrix as 



(2) 



E =E 



(u - u) (u - u) 



=GHFF H H H G H GHF F H H H G H + GOG H + I. 

(3) 

which is referred to as mean square error (MSE)-matrix (8). 
We are specifically interested in the following problem: 



tr {WE} 
subject to tr {*FF H } < P 



minimize 

G.F 



(4) 



where the optimization is over precoding and equalization 
matrices with given diagonal weight matrices W <G C nrXUr 
where the main diagonal of W is denoted by [w\ , ..., w nr ] with 
non-negative weights Wj > 0. Since the diagonal elements 
of the MSE-matrix are MSE values of the estimated symbol 
vector, the problem (@]i is often called weighted sum of mean 
square error minimization (WMMSE) problem (see [25 1, |26|, 
||28l , ||29l for details). We also account for a linear power 
constraint tr{<I?FF H } < P and specifically refer to the 
weight matrix 3? in the power constraint as the power shaping 
matrix. This matrix also can characterize the direction, in 
which the transmitted power can propagate while reducing 
the interference in other directions (e.g. to other users in a 
multiuser case). Additionally, we assume that the matrix *& 
is full rank and square with size of n t . This assumption is a 
practical constraint due to the fact that if $ is a rank deficient 
matrix then one can always transmit infinite power in one 
direction without violating the power constraint. Notice that 
when $ = I, the sum power constraint emerges. We refer to 
the matrices H, $, and $1 as system matrices. 

It is shown that any performance metric characterized by 
some particular function / (E) of the MSE-matrix E, can 
be approximated using the problem (|4]i |J6), ||251 . 11281 . The 
approach is that at each iteration, we select W = Ve/(E) t 
at the operating point E, then solve the optimization problem 



dULj. The algorithm iterates until the convergence is achieved. 
For example, to adopt sum rate maximization one can select 
W = E _1 at each iteration. This approach has been ex- 
tensively used in 0, (25), (26), Eg], ED to optimize any 
performance function of the MSE-matrices (e.g. sum-rate), 
since the resultant problem becomes convex by fixing any 
optimization variable. 

III. Perfect Knowledge of System Matrices 

We begin with the case of perfect channel knowledge 
which will be employed for the robust design. J26) . ll28l 
have considered WMMSE problem with perfect CSI, but their 
solutions for each of the precoding and equalization matrices 
are interdependent (requires alternative optimization). Here, 
we give an extension of the results in |6|, [8| when power 
shaping matrix is present. Detailed discussion of this problem 
with more general constraints is available in ll25l . Here, a 
special case of this result where the number of transmitted 
data streams is equal to n r is given. 

Lemma 1 i l25l/ : For any channel matrix H and given the full 
rank and square matrices 3? and $7, the optimum precoding and 
equalization matrices of the problem (@J have the following 
structure 



F=*-2V£, 

G =AU H $W 



(5) 
(6) 



where X and A are diagonal matrices with the diagonal 
elements oi > and Ai > 0, i = 1, ...,n r , respectively. 
U <G C" r xnr and V G C" f x ™ r are obtained by performing the 
singular value decomposition (SVD) of the following matrix 



n m* 



u[r o„ r x„ t -„ r ] v v 



(7) 



in which T contains its n r nonzero eigenvalues 71 > . . . > 
7„ r and V G (£«** («*-«,-) contains the right singular vectors 
corresponding to the zero eigenvalues^. 

Proof: The proof can be found in l27l . □ 

IV. Imperfect Knowledge of System Matrices 

In this case, only estimated matrices H and $1 and 4>, are 
available at the transmitter and receiver. Therefore, the actual 
value of these matrices can be described as a sum of the 
estimated matrices and the error matrices: 



n = n + An, 

$ = $ + A$. 



(8) 

(9) 

(10) 



We are interested in the joint optimization of the precoder 
and equalizer, while the unknown actual system matrices 

'The optimal forms of precoder and equalizer diagonalize the MSE- 
matrix E [25] and consequently the updated weight matrix W is diagonal. 

I — i -i 

~The matrix fi ^M 2 with probability one has a rank of n r , due to 
the random nature of the channel matrix H and the fact that n r < nt. 



are guaranteed to fit in the (norm-based) uncertainty region. 
Hence, the error region can be described as 

W = {(A H ,Ao,A$) : ||A ff || <e Hl ||An|| < en, 

n + AohO,||A«|| <£*,* + A* ho}. (11) 

Consequently, the worst-case transceiver design can be ex- 
pressed as 

minimize max tr {WE} 

F,G (A H ,A*,A n )SW (12) 

subject to tr {*FF H } < P. 

Please note that we consider a case, in which the uncertainty 
of the system matrices is the same at the transmitter and the 
receiver. We leave the scenario, under which the uncertainty 
of the system matrices at the transmitter is much higher than 
at the receiver to future work. 

A. Finding Least Favorable System Matrices 

We proceed by finding the worst-case estimation errors for 
the system matrices. First, we expand the objective function of 
(fTZt in terms of the estimated and error system matrices using 
the definitions (O and <T8Tt-(fTQb and simplify the worst-case 
problem as 



(A 



maximize tr (we! + tr {G H WGA i 

jf,A s ,Afi)eW I J 

+ tr {AA ff BA^} + 2<Ke {tr {CA H }} 

subject to tr [ <j?FF H j < P - tr { A<i,FF H } 

(13) 
where E is the MSE-matrix defined in OJ based on the 
estimated matrices H and O and 



A =G H WG, 

B =FF H , 

C =FF H H H G H WG FWG. 



(14) 
(15) 
(16) 



Least Favorable Matrices Ajj and A$: Since the error 
matrices are independent of each other, the least favorable 
interference plus noise covariance matrix can be obtained from 
the problem 



maximize tr { G H WG A q } 

||An||<«i _ L ' 

subject to O + A n y 0. 



(17) 



We assume that £q is small enough to ignore the positive 
semi-definite condition of the problem (fTTT i. Nevertheless, we 
will see that this relaxation gives us a solution, which also 
satisfies a positive semi-definite condition. Using Cauchy- 
Schwartz inequality, we can obtain 



tr{G H WGA } < ||G H WG|| • ||A n || < £o||G H WG| 



and the upper bound occurs when 

G H WG 



A* 



£Q1 



G H WG| 



(18) 



(19) 



We continue by finding the worst-case estimation error of 
the interference direction matrix, i.e. A$. It is trivial that the 
worst-case happens when the maximum allowed power is min- 
imized. Consequently, we are interested in this optimization 
problem: 

maximize tr{A$FF H } 

l|A*||<e* _ (20) 

subject to * + A$ y 

Similarly to the problem dl.71 . we can obtain the worst-case 

error matrix as 

FF H 

A ** =£ *IJFF1- (21) 

Substituting these worst-case estimation errors AJ and Aq 
into the problem (fT2l results in the terms £q||G h WG| and 
e$||FF || . Since we are interested in the worst-case scenario, 
we can use upper bounds of these terms alternatively. Hence, 
we can write inequalities 

en||G H WG|| <en||W*G|| 2 = e n tr {G H WG} (22) 
e#||FF H || <£*||F|| 2 =e$tr{FF H }. (23) 

where we have used the inequality ||XY|| < ||X|| • ||Y|| 
which can be proved by utilizing the Cauchy-Schwartz in- 
equality OTI . Since the Frobenius norm is invariant under the 
Hermitian operation, we get ||XX H || < ||Xj| 2 . Note that the 
approximations given in (l22l and d23l give an upper bound 
of the objective function of ( fT2b . By minimizing this upper 
bound, we also minimize the objective function. 

Now, we replace the terms tr{G H WGA^} and 
tr {FF H A$} in the robust transceiver problem (IPTt with the 
upper bounds defined in d22l and (l23l respectively. This is 
equivalent to setting 






(24) 
(25) 



With these fixed worst-case matrices, the robust transceiver 
design problem reduces to 

minimize max tr {WE} 

F,G ||A H ||<e« 

subject to tr{*FF H }<P (26) 

O = n + ejjl, * = $ + e$I. 

Least Favorable Channel Error Matrix A#: Now, we 
find the worst-case channel estimation error Ajy following 
the maximization problem for given $1 and 3>: 

maximize tr {AA H BA^} + 25He{tr{CA H }}. (27) 

l|Aff||<£H- 

Lemma 2: The least favorable channel estimation error for 
any given $ and ft has the following structure: 



A^ = fJ2UAV H *i 



(28) 



where U e C n - Xn - and V e C n - Xnt are defined in the SVD 

o 1 H 



o-2 H *-2=ur o„ rXnt _ nr v v 



(29) 



which gives the worst-case estimation error matrix A^. 



and A £ I" rX " r is a diagonal matrix with elements Si > 0. 



Proof: Due to limited space of this paper the detailed 
proof is included in [27 1. In order to explain our proposed 
transceiver optimization algorithm, here we only summarize 
the approach and results. The problem (|27) can be categorized 
as a trust-region subproblem 1132) . Il33l . The matrix-form 
restatement of this problem is given in H131 . It has been shown 
that the solution to this problem can be found by a minimiza- 
tion problem over an auxiliary variable •& > A max (A)A max (B) 
|32|, [ 33 J . The worst-case channel matrices coincides with the 
structure of the precoding and equalization matrices given in 
(|5) and (|6) using the worst-case interference plus noise and 
power shaping matrices defined in (|24) and ((25) ■ As a result, 
(5,s are given by 



? _ Wi\ i O-i(~1i\i<T l - 1) . _ 







w. 



',<y- 



(30) 



Note that 7^,1 = 1 , . . . , n r are the diagonal elements of T 
in ( |29) . Recognizing j = argmax^ (wiX 2 a 2 ), if -d > WjX 2 a 2 , 
then $ is the root of equation 



y^ wf X 2 u 2 (q l X l a l - l) 2 



'H- 



(31) 



£g 



where eh = 1 r 

l| ntii .||#g|| 

from equation ( f30b . Let 



If $ = WjX 2 a 2 , Sj cannot be found 



p[fi) 



i£ ($-w t X 2 a 2 f 



(32) 



Therefore, if p(wjX 2 a 2 ) < %\, then Sj = — y/s 2 ^ — p{&)- 
Otherwise, § > WjXjOj and it can be uniquely determined by 
(ED- □ 

B. Robust Transceiver Design 

Now, we can use the worst-case system matrices descrip- 
tions ( |24) . d25l ), and ((28) and substitute into the robust 
transceiver design problem. Note that using the trust-region 
subproblems [32), lt33l the resultant problem of finding worst- 
case channel estimation error A# becomes a minimization 
problem over an auxiliary variable d. The result can be 
compiled as follows: 

Theorem 1: The robust precoding and equalization matrices 
have the following structure: 

_ \_ 

(33) 

(34) 

where 

(i) U € C nrX ™ r and V e C n ' x " T are orthonormal matrices 
defined by the thin SVD 1341 



F = ( * + e»1 1 VE 
G =AU H (n + e n ll 



n + enl H * + e*I 



=urv 



H 



(35) 



where T e C" rX " r is a diagonal matrix with diagonal 
elements of 74 > 0, 



(w) A and £ are diagonal matrices of size n r with the 
diagonal elements of Xi,i — 1, ... ,n r and Cj, i = 1, . . . , n r , 
respectively and they are obtained through solving the scalar 
optimization problem 



l<i<n r 

subject to $ > WiXfcrf, i = 1, . . . , n r 



(36) 



fn'/J The optimum solutions for A, and Ui can be obtained 
with respect to d and a Lagrangian multiplier p as follows 



A,; —\ X 




(37) 



(38) 



where X , is a positive real root of a depressed quartic equation 

Pipf) ^y/pw 2 X i - (2Wii?VM + Wiyffi-yityX 2 

{^$ + w i )^>&X + $ 2 { y f]i- li Jfii)=Q. (39) 

If there is no real positive root, then Xj = 0. The closed- 
form solutions for the roots of the quartic equation ((39) can 
be obtained using the Ferrari's method ll35l and can be found 
in |27l . 

Proof: Due to limited space of this paper the detailed 
proof is included in 1(27) . Here, we summarize briefly the 
approach used. We first substitute (124) . (125) . and (|28) into 
the original problem (Q~2), and hence we simplify it to a 
minimization problem with respect to F, G and the auxiliary 
variable d. Now, using Lemma 1, the optimal expressions for 
the precoding and equalization matrices are given by (0 and 
(|6) for any values of error matrices. Substituting expressions 
for F and G and the worst-case system matrices (124) . (|25) . 
and (128) into ( fT2) . we can convert the problem into a scalar 
optimization problem, which can be simplified to (136) . Notice 
that the maximization preserves the convexity, therefore this 
problem is a convex optimization problem with respect to G 
and F and consequently in Xi and <ii,i — 1, . . . , n r . By fixing 
•&, we can solve the problem in Xi and a^. Next, the auxiliary 
variable $ will be updated following the Lemma 2. □ 

Our closed-form solutions are functions of the auxiliary 
variables $ and p. Using dual decomposition concept from 
[36 1, we can decompose the problem with outer loop optimiza- 
tion problems with respect to p and d. These values can be 
updated using a subgradient algorithm [37|. By differentiating 
the objective function in problem (136) with respect to d, we 
can obtain the subgradient direction for d as 



2^2 
3 



'2 y m i A i g i (A i (T i 7 i -l) 7 9\„,.\2 rr 

P {d) 



(40) 



?2 
-H 



V = w 3 X 2 o 2 



Similarly, by differentiation of the Lagrangian function of 
\, we can get the subgradient direction for p as A p = 



TABLE I 

Robust Transceiver Optimization Algorithm 

Initialize <t;s and A^s and fj, > 0, -d > rnax^ (wiXta'f}. 
Perform thin SVD 1351 to obtain 7js. 
Repeat (subgradient loop of i9) 

Update -d <- -d + 8$ A# using (40) . 
Repeat (subgradient loop of ^) 

Form the quartic equation l |39t for i = 1, . . . ,n r . 

Find its positive real root. 

Find Oi and A^ using j37t and (38). 

Update /i ^— ft + <5 M A M . 

Until satisfaction of )31> 

Replace A;s and <TjS into (33) and (34) and find F and G. 



S™=i °f — P- The robust transceiver optimization algorithm 
is summarized in Table U The algorithm consists of two loops. 
The inner loop solves the convex scalar problem with respect 
to \ and <jj and therefore it is convergent. The outer loop 
updates the auxiliary variable d using a subgradient method, 
which is based on the strong duality of the trust region 
subproblem ll32l . Il33l and consequently it is also convergent. 
The objective function of problem (|36| | is bounded and it is 
reduced in each iteration. Therefore, the algorithm in Table U 
is convergent. 

V. Numerical Results 

In this section, the performance of robust transceivers is 
evaluated numerically. The robust design guarantees a per- 
formance level for any point within the uncertainty region. 
Hence, the performance is displayed by the worst-case sum 
of MSE values, which are averaged over different system 
realizations. Each system realization is a result of a random 
generation of elements of the estimated system matrices (i.e. 
Hjfi?,*?^ which are i.i.d. Gaussian with zero mean. The 
uncertainty region is characterized by a parameter < e < 1. 
In our simulations, it is assumed that yi is the radius of the 
uncertainty region for each of the system matrices, when they 
are normalized by their Frobenius norms (i.e. e 2 H = g"||H|| 2 , 



ek=s\\n\ 



and 4 =e||*| 



). The non-robust transceivers 



consider the estimated system matrices as the actual system 
matrices and are discussed in Lemma 1 and ll25l . The worst- 
case estimation error matrices have been given in Section |IV] 
Note that only a solution of a special case of our problem is 
available in the literature, which includes uncertainty of the 
channel matrices H only. For this special case, our algorithm 
performs as the algorithm in [15| while it is less complex (by 
optimizing the precoder and equalizer jointly and reducing 
the problem to a quartic equation). It is shown for the special 
case (uncertainty of H) that our algorithm performs as well as 
SDP methods with much lower complexity using an iterative 
approach. 

Fig. Q] shows the comparison of robust and non-robust 
designs for different values of e, i.e. the size of uncertainty 
regions. The sum-MSE of the transceivers obtained with 
perfect knowledge of system matrices is also given as a 
baseline. Fig. [2] explicitly illustrates the performance of the 
robust and non-robust design with respect to the size of the 




Fig. 1. Comparison of the proposed robust design, the non-robust design 
[27], and the transceiver design when system matrices are perfectly known 
(perfect CSI) for nt = n r = 2. 
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Fig. 2. Comparison of different transceiver designs with respect to the size 
of uncertainty region s for nt = n r = 2. 



uncertainty region. As expected, the performance of the robust 
transceivers degrades at a much lower pace with increase of 
signal-to-noise-ratio (SNR) and the size of uncertainty region 
e compared to the non-robust transceivers. 

VI. Conclusions 

We have designed the robust transceivers when the channel 
matrix, interference plus noise covariance matrix, and power 
shaping matrix (system matrices) are all imperfectly known to 
the transmitter. The closed-form expressions for the precoder 
and equalizer have been found. This involves finding the worst- 
case system matrices first and then simplifying the problem 
to a scalar convex form. The solution to this optimization 
problem can be described in a form of a depressed quartic 
equation, the closed-form expressions for roots of which are 
known. Finally, we have proposed an iterative algorithm to 
obtain robust transceivers, which is significantly less complex 
compared to SDP-based alternating optimizations. Moreover, 



accounting for imperfect knowledge of all system matrices 
enables for the extension of our approach to the multiuser 
scenario, which is the subject of our current work [271 . 
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